Multimodal Summarization of User-Generated Videos
نویسندگان
چکیده
The exponential growth of user-generated content has increased the need for efficient video summarization schemes. However, most approaches underestimate power aural features, while they are designed to work mainly on commercial/professional videos. In this work, we present an approach that uses both and visual features in order create summaries from Our produces dynamic summaries, is, comprising “important” parts original video, which arranged so as preserve their temporal order. We use supervised knowledge aforementioned modalities train a binary classifier, learns recognize important Moreover, novel dataset contains videos several categories. Every 1 s part each our been annotated by more than three annotators being or not. evaluate using classification strategies based audio, fused features. experimental results illustrate potential approach.
منابع مشابه
Multimodal Semantics Extraction from User-Generated Videos
User-generated video content has grown tremendously fast to the point of outpacing professional content creation. In this work we develop methods that analyze contextual information of multiple user-generated videos in order to obtain semantic information about public happenings (e.g., sport and live music events) being recorded in these videos. One of the key contributions of this work is a jo...
متن کاملFast Summarization of User-Generated Videos Using Semantic, Emotional and Quality Clues
This paper introduces a novel approach for fast summarization of user-generated videos (UGV). Different from other types of videos where the semantic contents may vary greatly over time, most UGVs contain only a single shot with relatively consistent high-level semantics and emotional content. Therefore, a few representative segments are generally sufficient for a summary, which can be selected...
متن کاملSummarization of ICU Patient Motion from Multimodal Multiview Videos
Clinical observations indicate that during critical care at the hospitals, patients sleep positioning and motion affect recovery. Unfortunately, there is no formal medical protocol to record, quantify, and analyze patient motion. There is a small number of clinical studies, which use manual analysis of sleep poses and motion recordings to support medical benefits of patient positioning and moti...
متن کاملPredicting Emotions in User-Generated Videos
User-generated video collections are expanding rapidly in recent years, and systems for automatic analysis of these collections are in high demands. While extensive research efforts have been devoted to recognizing semantics like “birthday party” and “skiing”, little attempts have been made to understand the emotions carried by the videos, e.g., “joy” and “sadness”. In this paper, we propose a ...
متن کاملTitle Generation for User Generated Videos
A great video title describes the most salient event compactly and captures the viewer’s attention. In contrast, video captioning tends to generate sentences that describe the video as a whole. Although generating a video title automatically is a very useful task, it is much less addressed than video captioning. We address video title generation for the first time by proposing two methods that ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2021
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app11115260